1 

Control Communication Complexity of 
Distributed Actions 

Wing Shing Wong and John Baillieul 

Abstract 

Recent papers have treated control communication complexity in the context of information-based, 
multiple agent control systems including nonlinear systems of the type that have been studied in 
connection with quantum information processing. The present paper continues this line of investigation 
into a class of two-agent distributed control systems in which the agents cooperate in order to realize 
common goals that are determined via independent actions undertaken individually by the agents. A 
basic assumption is that the actions taken are unknown in advance to the other agent. These goals can 
be conveniently summarized in the form of a target matrix, whose entries are computed by the control 
system responding to the choices of inputs made by the two agents. We show how to realize such 
target matrices for a broad class of systems that possess an input-output mapping that is bilinear. One 
can classify control-communication strategies, known as control protocols, according to the amount of 
information sharing occurring between the two agents. Protocols that assume no information sharing on 
the inputs that each agent selects and protocols that allow sufficient information sharing for identifying 
the common goals are the two extreme cases. Control protocols will also be evaluated and compared 
in terms of cost functionals given by integrated quadratic functions of the control inputs. The minimal 
control cost of the two classes of control protocols are analyzed and compared. The difference in the 
control costs between the two classes reflects an inherent trade-off between communication complexity 
and control cost. 

Index Terms 

Information-based control system, Control communication complexity, Brockett-Heisenberg system 

1. INTRODUCTION 

In [|29ll and ll30l the authors proposed the concept of control communication complexity as 
a formal approach for studying a group of distributed agents exercising independent actions 
to achieve common goals. For distributed cooperative systems, it is natural to expect that 
communication can help improve system performance, such as reducing the control cost. In 
this paper, we demonstrate how the concept of control communication complexity can lead to 
an inherent estimate of the value of communication bits in reducing the control cost. 
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Information-based control theory aims to deal with systems in which the interplay between 
control and communication are closely intertwined. For some early work see for example ifTOl . 
E], fl32l, OH, Ha. IH, [El, and [|28]|. In this paper, we investigate information-based systems 
controlled by two distributed agents. In particular, we focus on cooperative control, the goal of 
which is for the agents, Alice and Bob, to induce a system output that depends jointly on the 
controls they independently select from their respective finite sets of control inputs. The problems 
treated below have extensions to settings in which the number of agents is larger than two, but 
such extensions are beyond the scope of the present paper. 

The concept of multiple distributed selections of control actions from a specified set of possible 
choices has not received much attention in the control literature until recent work by the authors. 
While we believe this perspective is novel, there are connections with earlier work on cooperative 
decision making such as in the team decision problems treated in lfl"5l and 0}. It also makes 
contact with (but differs from) a substantial body of work that has been devoted to extending the 
concepts of centralized control to the treatment of distributed and multi-agent systems. Space does 
not permit a complete survey, but relevant work includes 11341 where the exchange of information 
is modeled and then used to reduce a certain non-classical stochastic optimization problem to 
a classical one. The recent papers [35] and [|22ll address problems of distributed control with 
feedback loops closed through networks of communication-constrained data channels. 

The aims and fundamental problems encountered in our work on control communication 
complexity are substantially different. In what is reported below, the objective has been to extend 
ideas of communication complexity theory such that the cost of both communication and control 
is explicitly modeled and taken into account. The models being proposed are abstractions of a 
variety of physical processes arising in diverse applications such as controlled quantum spin 
systems and motion control of robotic vehicles. Our goal is to understand the general principles 
underlying communication by means of the dynamics of such systems. 

While the distinction between our work below and previous work on decentralized control is 
real, the differences can be subtle as in the following example. Suppose that two agents wish 
to find touring strategies in order to meet as quickly as possible, as exemplified by the scenario 
of a mother and her child separated in a crowded park. For that problem, a practical solution, 
not necessarily optimal, is to ask the child to stay in one place and for the mother to conduct 
a complete tour of the park. For a more complex situation, consider Alice and Bob who jog in 
the same park at the same time every day. By tacit understanding, they wish their paths to cross 
or not to cross according to their moods (the choices) as prescribed in the following table 





Alice in good mood 


Alice in bad mood 


Bob in good mood 


Paths cross 


Paths do not cross 


Bob in bad mood 


Paths do not cross 


Paths do not cross 



Table 1 

If Alice and Bob can call each other to communicate their choice of inputs, the problem is 
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trivial to solve. However, if direct communication is not available or allowed, a basic question is 
whether it is possible for Alice and Bob to follow different tour paths based on their moods to 
accomplish the stated objective. Moreover, if multiple feasible solutions exist, we are interested 
in identifying those that are optimal with regard to an appropriate metric, such as the total energy 
expended by the agents over a period of time encompassing many jogs in the park. 

Although Table 1 bears resemblance to the payoff functions in classical game theory, one 
cannot over-emphasize the point that there is no optimization of the table values in our problem 
formulation. 

A second example that is more closely aligned to the model studied here is the mobile sensor 
network positioning problem introduced in [|29ll . Consider a mobile sensor network, such as a 
network of remote sensing satellites. The sensor network serves two agents, Alice and Bob; each 
of them wants to monitor a geographic region of interest selected from a pre-defined list. The 
agents do not communicate to each other directly; in fact, they may not even know the identity 
of the other party. If for any given ordered pair of choices there is an optimal configuration 
for the sensors, it is natural to investigate whether one can design communication and control 
strategies, known as control protocols, for the agents to jointly maneuver the mobile sensors to 
the optimal configuration based on their own individual control selections. Moreover, if such 
feasible control protocols do exist, a second objective is to find the optimal protocols under 
appropriately defined performance measures. 

To fix ideas for subsequent discussions, there is assumed to be a control system with two 
input channels, one to be used by each of the two agents. The case of larger numbers of agents 
will be considered elsewhere. Throughout the paper we index the finite collection of control 
inputs that Alice can send to the system by a set of labels A = {1, . . . , m}. Similarly, a finite 
set B = {l,...,n} is used to label the controls available to Bob. It is assumed that over 
many enactments of the protocol the inputs used by Alice and Bob will appear to be randomly 
chosen samples from uniformly distributed random variables with sample spaces A and B. If 
one represents the target output when Alice chooses a = i and Bob chooses (5 = j by Hy, then 
the m-by-n matrix 

H = [Hy] (1.1) 

provides a compact representation of the set of target outputs for all possible choices of inputs 
and will be referred to as the target matrix. 

While the structure of control protocols will be explained in the following section, a basic 
observation is that the input labels of the agents, a E A and (3 e B, are key inputs to these 
protocols. Once a and (3 are specified, following the basic premise of lfT6ll . the control protocol 
is assumed to run to completion. For a control protocol, V, let x(V(i,j),t) represent the state 
at time t when a = i and /3 — j. If the system output mapping is represented by F, the 
feasibility problem of protocol-realizing control is to determine whether it is possible to design 
a control protocol, V, so that at the termination time, T, the following condition is satisfied for 
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all z £ {1, ... , m} and j 6 {1, . . . , n}: 

F(x(P(z,j),T)) = H JJ , (1.2) 

Solutions to the feasibility problem may involve control protocols that require observations 
of the system state; even though there is no direct communication links between the agents, it 
is possible for them to signal to one another via the dynamic system. An example of such a 
control protocol was discussed in OOll . The number of bits exchanged during the execution of a 
control protocol is a useful indicator of its complexity. On the other hand, control protocols can 
also be evaluated in terms of the control cost incurred, for example as measured by the control 
energy required. Although communication complexity and control cost seem unrelated, we will 
demonstrate that there is a close relation between the two. 

To make our results concrete, we will focus on a class of systems whose input-output mappings 
are bilinear. A prototypical example of such a system is the Brockett-Heisenberg system, which 
we hereafter refer to as the B-H system. The system is arises in sub-Riemannian geometry 
([4]), and it shares essential features with models arising in nonholonomic mechanics ( Il2"6ll . ll27l0 
and quantum mechanics ( [[301 ). For B-H systems one can characterize exactly when protocol- 
realizing control problems have feasible solutions in the absence of any communication between 
the agents. Moreover, when a problem is feasible it is possible to determine the minimal control 
cost needed to achieve it. On the other hand, in the case that the agents have partial information 
about each other's choice of inputs, it will sometimes be possible to decompose the control 
communication problem into simpler parts on each of which a smaller control cost can be 
calculated. The concept of e- signaling will be introduced, by means of which the agents may 
share partial information with each other. Pursuing this idea, it will be possible to estimate the 
minimal control costs for two extreme scenarios, one without side communication or partial prior 
knowledge and the other with enough communication to allow the players to precisely compute 
intermediate results regarding the target matrix. The two extreme scenarios suggest a natural 
framework for appraising the inherent value of a communication bit in terms of control cost. 

The organization of the paper is as follows. In section 2, we provide a description of the basic 
model as well as the definition of key concepts. In section 3, background results on a bilinear 
input-output system are presented. In section 4 we describe how to transform the optimization of 
a single round protocol into a matrix optimization problem, the solution to which is presented in 
section 5. Implications of the result for understanding the trade-off issue between communication 
complexity and control cost is explained in section 6. Multi-round protocols are discussed in 
section 7. Section 8 provides a brief conclusion of the paper. 

2. THE BASIC MODEL 
The dynamical systems considered here are inherently continuous time systems of the form: 
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^x(t) = d(x(f), U (^(i)) ) x(0) = x GR ff , 

z(t) = c(x(i)) G R, 
where d is an arbitrary smooth vector field and u and v are scalar control functions that, once 
chosen, are applied over a time interval of standard length T. The output z(-) is sampled at 
discrete time instants t < t\ < . . . , where t = and tfc+i = tk + T. Information about the state 
is made available to the two agents through encoded observations b^(x(t)) and b#(x(t)) which 
are made at the same time instants. The fact there are standard intervals of time over which 
observations and selected control inputs are applied allows the analysis to make contact with 
prior work on information-based control of discrete-time systems. Under our assumptions, we 
thus consider the following simplified version of the model introduced in [|29l . For k — 1, 2, . . ., 
let tk represent the fixed time when observations are taken. Define 

x fc = x(t fc ). (2.2) 

Observations of the state, b^(x k ) and b B (x fc ), are made available to each agent as encoded 
messages, Q k A ' and Q[ B \ consisting of finite bit-length codewords; in other words, the ranges of 
these quantization functions are finite sets. Computation and communication delays in reporting 
observations to the agents are assumed to be negligible. The agents select respective control 
actions, Uk = P k {Q k ,oi) and t> fc = P k '{Q[ B \ (3) , where represents the set of coded 
observations {(^(b^Xfc)), . . . , (^(b^Xi))}; Q k is defined similarly. Note that the arguments 
of u k and Vk are defined on finite discrete sets. It is assumed that for each element in the 
corresponding argument set, Uk identifies a unique element in a set of admissible Lebesgue 
measurable control functions over [tk,tk+i); similar assumption holds for v k . If one imposes a 
probability measure on the sets of choices, these controls may be interpreted in terms of the 
observation sigma algebras over the corresponding probability space. However, this probabilistic 
interpretation is not essential to the results presented in this paper. 

For reasons of simplicity, we assume that the controls u and v are scalar functions. The 
codewords identifying the selected controls are transmitted to the dynamic system. Computation 
and communication delays associated with this step are also assumed to be negligible. Thus at 
time tk the dynamic system can determine the control selected by Alice and Bob. 

The state transition (2.1 ) between times tk and tk+i is thus described by the following discrete 
time control model where the controls are square integrable scalar functions: 

x fc+ i = a(x fc , u k , v k ), x(0) = x G R N , 
y[ A) = b A (x fc ) G R lA , y[ B) = b B (x fc ) G R iB , 

u k = Pk A (Ql A \a),Vk = P k B (Qi B) ,P), { • } 

z k = c(x fc ) G R. 

The quantity z k = c(x fc ) is a global system output that is observable to Alice, Bob, and 
possibly to exogenous observers as well. The protocol parameters, a and (3, are specified at 
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time t an d remain unchanged while the protocol runs to completion. The case where these 
parameters are allowed to change over time is an interesting extension which is not considered 
here. 

Definition 2.1. A control protocol, V, consists of the functions: 



{Q { ^= AQi B) }T=o,{Pi A) }^ and {Pf >} 



oo 

fc=0- 



We define the epoch between time tk and tk+i as round k + 1. In round k + 1, an agent 
first observes and then selects a control to be applied to the system. The selected controls 
u k = Pk(Qk A) (b A (xk)), «) and v k = Pj? (Q k B) (h B (x k )), f3) will typically be time- varying in the 
epoch between time t k and tk+i, but they depend only on the encoded value of the state at time 
tk- Hence, the agents use essentially open-loop controls during the round, but the selection of 
the control at the beginning of the round can depend on partial state information. 



Definition 2.2. Consider a dynamic system, {2.3), with parameters defined by (a, h A , h B , c, x ). 
A target matrix H is said to be realizable at termination time Tf if there exists a k-round 
protocol, such that 

tk = Tf, 

and for any choice of indices i E A and j E B, the k-round protocol defined with a = i and 
(3 = j terminates with 

c(x fc ) = H^-. (2.4) 

The initial state is assumed to be fixed and known to the agents. For the first round, there 
is no need for any communication to the agents. That is, we assume the quantization functions 
Qq and always take the same value for all control choices and need not be transmitted. 

Because a target matrix may be realized by different protocols, there is an interest in identifying 
those that are optimal with respect to some performance measures. One such measure is to count 
the number of communication bits exchanged during the protocol execution. To make this precise, 
[[291 introduced the concept of control communication complexity by extending the concept of 
communication complexity that was introduced in computer science by Yao 11331 . Briefly speaking, 
given a dynamic system and a target matrix, one defines the protocol complexity of a feasible 
protocol to be the maximum number of bits exchanged by the agents in running the protocol to 
completion. The control communication complexity is then the minimum protocol complexity 
over the set of all feasible protocols. A caveat: unlike classical communication complexity, 
control communication complexity is defined with regard to a fixed dynamical system. 

In the models considered in the present paper, control inputs are square integrable functions, 
and this suggests measuring the complexity of a protocol in terms of the integral of a quadratic 
function of the control. This makes contact with performance measures commonly used in 
centralized control. Indeed, control communication complexity provides a rich new class of 
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optimal control problems. While the relation between communication complexity and integral 
control cost may not be transparent, the analysis of the B-H system in ll30ll demonstrates that 
there is an intriguing relation between them. Intuition suggests that protocols in which there is 
limited communication between the agents except for their common observations of the system 
dynamics may require a larger integral control cost than those that employ a large number of 
communication bits in addition to the system observations. If the agents have partial information 
about each others choices of inputs, control laws can be more precisely tailored. A contribution of 
this paper is to take the first step towards analyzing this trade-off by comparing two limiting types 
of control protocols, namely, single round protocols which entail no communication bits, and 
protocols in which agents share partial information about which elements a E A and (3 E B are 
governing the execution of the protocol. We provide a fairly complete treatment of single round 
protocols, but multi-round protocols are less well understood at this point. As a lower bound 
on the control cost, we assume that it is possible for the agents to communicate their choices 
of inputs to each other without incurring any control cost or state change, thereby decomposing 
the original problem into a sequence of standard optimal control problems. The difference in 
the minimal control costs between this type of protocol and the single round protocols having 
no communication provides an upper bound on the saving afforded by the bits of information 
that are communicated between the two agents as described in [|29l . 

In the current model the only unknown parameter to Alice is Bob's choice of control input 
and vice versa the only unknown parameter to Bob is Alice's choice of input. The single choice 
problem in which each agent has only one choice of control input is equivalent to the scenario 
where the agents have complete information on the system. Since the initial state is known and 
there is no state noise, optimal distributed control can be devised without resorting to any state 
observations. In other words, for optimizing single choice problems, we only need to consider 
single round protocols. 

For the rest of the paper we focus on scalar output functions to simplify the analysis. Vector- 
valued outputs, as necessitated by the sensor network example, present a significantly larger 
technical challenge, and will not be discussed here. 

To analyze the cost of letting our control system evolve under different input curves in single 



round protocols, we lift the evolution dynamics (2.3) back up to the continuous domain as 



described by equation (2.1 ) but with c replaced by a scalar function. 

Unless stated otherwise, the controls u and v exercised by the agents are assumed to lie in a 
closed subspace C C L 2 [0,T]. Let C® C represent the tensor product Hilbert space with inner 
product defined by 

< U\ ® V\ , U2 ® v 2 >=< Ui, u 2 >< Vi, v 2 > ■ (2.5) 



At time T, the input-output mapping of system (2.1 ) can be regarded as a functional from C®C 



to K, denoted by F. F of course depends on the initial state x but as the state is assumed to be 
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fixed and since for the time being we consider only single round protocols, this dependency can 
be hidden for simplification. Without loss of generality we assume that to = and t\ = T = 1. 
F is a bounded functional if there exists a finite ||F|| so that for all (u,v) E C® C, 

\F(u,v)\ < \\F\\\\(u®v)\\ mc = \\F\\\\u\\ c \\v\\ c . (2.6) 

To realize a given target matrix H, the optimal controls in general depend jointly on the 
parameters, a and (3. However, if the agents make their choices independently and there is 
no communication between them, u can only depend on Alice's parameter a and v on Bob's 
parameter f3. 

A target matrix H is realized by a single round protocol V if there exist sets of controls, 
U = {iti, . . . , u m } and V = {v i, . . . , v n }, so that 

F(u i ,v j ) = H ij . (2.7) 

We emphasize that F represents the system output at time T. Such a single round protocol 
solution may not always exist as we will see in subsequent sections. The problem may become 
feasible if Alice and Bob can exchange information about their choices to each other. 

Given the system (2.1 ) and the parameter sets A and £>, the cost of a single round control 
protocol V is defined as an average of the required control energy, given explicitly by the formula, 

I(U, V) = - V / ul{t)dt + -y,f v*(t)dt. (2.8) 
m Jo n ~tJo 

One can also write equation ( |2.8| ) in the form, 

i=i j=\ V^o Jo J 

which highlights the fact that the control cost is averaged over all possible event outcomes based 
on the control actions that are chosen by the agents. 

In subsequent sections, we compute the minimum averaged control energy for an arbitrary 
target function H. That is, our aim is to compute 

C F CH) = min I(U,V) (2.10) 

u,vcc 

subject to the constraints that for i = 1, . . . , m and j = 1, . . . , n, 

F(u i ,v j ) = H ij . (2.11) 

Before concluding this section we note that there are some similarities between the cooperative 
control communication protocols studied in this paper and more classical dynamic game strategies 
as studied in, say, [3]. Yet there are fundamental differences; for example, optimization of payoff 
functions is not the focus of our investigation. Using the rendezvous problem as an example, 
once the moods of the the agents are fixed, the outcome to be achieved is automatically defined 



February 2, 2012 



DRAFT 



9 



by the target matrix. The investigation focus is on how to ensure that the target objective — paths 
crossing or not crossing — can be guaranteed. The work here also has some connection with 
the many papers in the literature dealing with distributed control of mobile agents, multi-agent 
consensus problems, and classical team decision theory. See, for instance, ll8ll,ll^, lfTT1l . irT4l , [[T5l , 
and [24J. In these papers the dynamics of the subsystems controlled by the agents are usually 
not tightly coupled and the control cost is not explicitly calculated, unlike the models we are 
considering here. Moreover, allowing agents to select controls from sets of standard inputs is 
also a fundamental point of departure. 

3. SYSTEMS WITH BILINEAR INPUT-OUTPUT MAPPINGS 
The simplest system defined in Section 2 is probably of the following linear type: 

dx(t) 

= Ax(i) + u(t)bi + v(t)b 2 , x G R N , 

y A (t) = y B (t) = z(t) = c T x(t) g R. 

The input-output mapping of this system is affine in (u,v). Moreover, it is easy to check that 

if F(ui,v k ) = H ik , F(ui,vi) = H a , F(uj,v k ) = H jk , and F(uj,vi) = H jt for some u h u jt v k , 
and vi, then 

Hik — H jk = H u — Hji. (3.2) 

Thus, there are severe restrictions on the set of realizable target functions for such an input- 
output mapping. A class of distributed control systems that realizes a richer class of input-output 
mappings has two independent input channels entering the system in a jointly bilinear fashion. 
More precisely, we have the following definition: 



Definition 3.1. Consider a system defined by equation ( 2.1 ) with control functions defined in C, 
a closed subset of L 2 [0,T]. The system is a bilinear input-output system if for any time t > t , 
the output at t, regarded as a mapping (u(-),v(-)) i-> z(T) from C® C to R is bilinear in the 
control function ordered pair (u,v). 

In state-space representation, two prototypical classes come to mind. The first class is: 

^ = Ax(t) + u(t)v(t)h, x(0) = G R N , (3 3) 

y A (t) = y B (t) = z(t) = c T x(t) G R, 
for some matrix A and vectors b and c. 
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A second, in a way more important class, comes from the Brockett-Heisenberg (B-H) system 
and their generalization, 0|,ED. The B-H system, denoted by can be described as follows. 



d 
db 



\ 



( x 

y 

\ z ) 



( 



\ 



u 

V 

vx — 



uy J 



( x(0) \ 

y(o) 

V z(0) J 





V / 



(3.4) 



VA® = y B (t) = z(t) = c((x(t),y(t),z(t)). 

The fact that the input-output mappings of these systems are bilinear in u and v can be easily 
verified. Note that in both cases we could take C to be L 2 [0, T] which results in a bilinear input- 
output function with an infinite dimensional matrix representation. On the other hand, we could 
also restrict C to some finite dimensional subspace of L 2 [0,T] if only certain control functions 
are allowed to be used in the system. 

While our main results make use of properties that are to some extent particular to (3.4), it is 
important to note that the system (3.4) has features of an intrinsically geometric nature in common 
with a much larger class of two-input "drift-free" control systems whose output mappings are 
not necessarily bilinear functionals of the inputs. These systems arise in applications including 
the kinematic control of nonholonomic wheeled vehicles (See, e.g. ifTTI .) and the control of 
ensembles of spin systems arising in coherent spectroscopy and quantum information processing. 
(See [13.) To illustrate how important features of a broad class of two-input systems are held in 
common with (3.4), we consider the Bloch equations in a spatially fixed frame in which there is 
an rf-inhomogeneity but no Larmor dispersion. The system is governed by ordinary differential 
equations evolving on the rotation group SO (3). 



X 



where 



a, 





X(0) 




I- 


( \ 









10 0-1,^ = 


(• 





I) 


V 1 / 










(3.5) 




and Q 2 



are the infinitesimal generators of rotations about the x-,y-, and z-axes in IR 3 . To briefly explore 



the connection between (3.4) and (3.5), we recall the area rule. 



Lemma 3.1. (Brockett, [|6||.) Let u, v e L 2 [SO(2)] be as above, and let x,y be defined by {3.4) 



Then x, y e L 2 [SO(2)], and z(-) as defined by (3.4) satisfies 



z(l) 



v(t)x(t) -u(t)y(t)dt 



xdy — ydx = 2A, 
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where A is the signed area enclosed by (x(t),y(t)). 



The connection between (3.4) and (3.5) is made by the following 



Example 3.1. Let the inputs 

u{t) = ecos(27rt), v(t) = esin(27rt) 
be applied to both ( |3.4[ ) and ( |3.5[ ). For ( |3.4[ ), corresponding to these inputs, 



x(t) 



2tt 



sin(27rt) and y(t) 



2tt 



cos(27rt). 



By Lemma 3.1 (or a direct calculation), the output of (3.4) is z{l) = e /(2ir). For this choice 



of input, (3.5) may be solved explicitly yielding 

X{t) = exp 





exp 



Note that the first factor in this expression has period 1, while the second factor is periodic with 



cos 9 sin 9 
— sin 9 cos 9 



period T = 1/^1 + (e/(2ir)) 2 . At time T, 

X(T) = 











1 



where 6 = — (l/2)(e 2 /(27r)) + 0(e 2 ). Hence, the output of (3.4) provides a good approximation 
of the ^-axis rotation 9 = —z(l)/2 resulting from application the inputs u, v to (3.5) over the 
time interval [0, 1/ yT+ (e/(27r)) 2 ]. We refer to ll30l for further information regarding area rules 



and approximate motions of (3.5) on SO (3) and to [17] and [|7) for information on the general 
application of area rules in two-input control systems. As detailed in [|20l , systems of the form 
( |3.5 ) can be used to guide the design of rf pulses for quantum control experiments. While 



( |3.5[ ) does not have the same natural bilinear input-output structure as ( |3.4[ ) the approximate 
relationship between the two system responses to closed curve inputs augurs well for potential 
applications of the computability results to be established in the remainder of the paper. 

With such broader applications of B-H systems in mind, we confine our attention in the 



remainder of the paper to studying the response of (3.4) to those closed input curves (u(t), v(t)) 
that give rise to closed curves (x(t),y(t)). Recall that U = {u\, . . . , u m } and V = {v 1 , . . . ,v n }, 
represent the sets of the control actions. We assume the controls are elements in C, a closed 
subspace of L 2 [0, 1] consisting of functions that can be represented by the following type of 
Fourier series with square summable coefficients: 

Ui(t) = V^SfeLi [ a i,2k-i sin(27rH) + a^k cos(27rH)] , 
Vj(t) = V2J2™ = i [-b jt 2k-i cos(2irkt) + b j}2 k sin(27rfct)] . 



(3.6) 
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This restriction is not essential to the investigation reported in this paper, but it allows connection 
to the results presented in ll30l and 0. Note that C contains continuous periodic functions with 
zero mean and period one. From basic orthogonality properties of the sine and cosine functions, 
one can show: 



Lemma 3.2. If the controls Ui and Vj are used, then at time t — 1, 



Oi,2fc-l^',2fc-l 



*(!) = E fc =i — + E fc =i 



ilk 



irk 



Moreover, 



a i,2k-l + a l 



/ ui(t)dt = j2 
J ° fc=i 

„i oo 



(3.7) 



(3.8) 



(3.9) 



The proof is a straightforward calculation and is omitted. Define matrices Ub, V b , and F B 
as follows: 



Ut 



a l,l a l,2 a l,3 

«2,1 «2,2 0-2,3 

Om,l Q"rn,2 <^m,3 • • • 

b\,l 6i, 2 &1,3 • • • 

^2,1 &2,2 ^2,3 • • • 



b n ,l &n,2 ^n,3 



(3.10) 



(3.11) 



1 

7T 



1 



1 













2" 1 







2- 1 












3- 1 










3- 1 



(3.12) 



(3.13) 



Lemma 3.3. The output of the B-H system is given by UbFbVb T . Moreover, 

m „i n „x 

V / u^dt = trU B U B T , V / v 2 t {t)dt = trV B V B T . 
i=x Jo i=l Jo 

The proof is straightforward and is omitted. It should be noted that the weight assignment 
and the indexing in (3.6) are chosen to allow the matrices U B ,V B , and F B to assume these 
simple representations. The control cost defined in (2.8) can be rewritten as 



'he control cost defined in (2.8) can be rewi 

I(U, V) = f-trU B U B T + -trV B V B T ) 

\m n J 



(3.14) 
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These results can be generalized to any system where the input-output mapping, F, is a 
bounded functional on C ® C. Let {e±, e 2 , . . .} and {/i, / 2 , . . .} be orthonormal bases (possibly 
the same) for £ with respect to the standard inner product on L 2 [0, 1]. Let L be the order of C, 
which could be infinity. If u — J2i=i r i e i an ^ v = J2j=i s jfj> by the bounded bilinear 

L L L 

F{u, v) = F(Y^ ne h s ifj) = Yl r i s J F ( e i' fj)- ( 3 - 15 ) 

i=l j=l i,j=l 

We can represent U{ Eli and v 3 - E V as L dimensional row vectors by means of their coefficients 
with respect to these orthonormal bases. Let U be the m by L matrix whose i-th row is the vector 
representation of Ui and V be the n by L matrix whose j-th row is the vector representation 
of Vj. Under these bases, F has an L by L dimensional matrix representation: F = [F(e i: fj)] ; 
that is, F(ei, fj) = F {j . 

Proposition 3.1. The condition that the targets are achievable can be expressed by the equation: 

H = UFV T . (3.16) 

The proof is straightforward and is omitted. 

As usual, define the rank of F to be the number of its independent columns (or rows). The 
rank of F is said to be infinite if there is no finite subset of columns (rows) in terms of which all 
columns (rows) can be expressed as linear combinations. One can show via arguments presented 
in Appendix A that the rank is independent of the bases chosen. Hence, we can speak of the 
rank of a bilinear map without ambiguity. 

Proposition 3.2. There exists a single round protocol that realizes an m-by-n target matrix, H, 
of rank k, if and only if the rank of the bilinear input-output mapping F is at least k. 

Proof: Let H fc be a full rank k-by-k sub-matrix of H. If there is a single round protocol to 
realize H, we can select controls from the solution protocol to obtain a realization for H fc . 
Conversely, if a single round protocol exists for H fc , we can extend it to obtain a protocol for 
realizing H by adding control functions to the control sets according to the additional choices 
allowed; these added controls can be constructed as linear combinations of the original controls 
used in realizing H fc . Hence, we can assume without loss of generality that m = n = k and H 
is a k-by-k full rank matrix. 

Suppose a single round protocol exists for such a target matrix. The solutions consist of k 
controls for Alice and k for Bob. We can construct an orthonormal basis for C so that the 
controls used by Alice are spanned by the first k basis elements. Construct similarly a basis for 
Bob. Represent F in terms of these bases and let F denote the restriction of F to the first k 
basis elements. Then, F is a k-by-k matrix and the equation 

H = UFV T (3.17) 
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has a solution. It follows that the rank of F, and hence, F is at least k. 

Conversely, if there exists a A; -by- A; matrix F of full rank representing the restriction of F to 



some control subspaces, equation (3.17) always has a solution for any £>by-A; matrix, H. 



The previous proposition implies that the existence of a target realizing single round protocol 
depends on the rank of the bilinear input-output mapping and the rank of the target matrix. For 
a bilinear input-output mapping of finite rank, there exists target matrices that cannot be realized 
without communication between the agents. For a bilinear input-output mapping of infinite order, 
any finite dimensional target matrix can be realized by a single round protocol. 

Given a matrix representation, F, for j = 1, 2, . . ., define its j -dimensional leading principal 
minors by: 



F U ... F, 



j-j 



(3.18) 



Definition 3.2. A matrix representation of a bilinear map is regular if for all positive integers 
j less than or equal to its rank, its j -dimensional leading principal minors are nonsingular. 

Definition 3.3. A matrix representation of a bilinear map is strongly regular if it is finite 
dimensional and regular or if it is in diagonal form such that for all positive integers j the 
diagonal elements satisfy the ordering 

F jd > F j+W > 0. (3.19) 

It is easy to find examples of finite dimensional regular or strongly regular matrix representa- 



tions. For the B-H system, the input-output mapping that takes (u,v) to z(l) by means of (3.4) 
has a matrix representation F B described by ( 3.12[ ) with respect to the orthonormal basis, 



B = {V2sin(27rt), >/2cos(27rt), v 7 ^ sin(47rf), 

\/2cos(47rt),...}. (3.20) 

The matrix is diagonal of infinite rank and provides an infinite dimensional strongly regular 
representation of F. 

To prepare for subsequent discussions we recall some basic ideas on singular values. A p-by- 
q matrix, M, has p singular values while M T has q singular values. It is well-known that the 
nonzero singular values of the two matrices are identical (see [ |2T| for details). We denote by 
<7j(M) the z-th singular value of M under the ordering 

ai(M) > <r 2 (M)... >a p (M). (3.21) 

If M is a p-by-p symmetric matrix, represent by Aj(M) the i-th eigenvalue under the ordering 

Ai(M) > A 2 (M)... > Ap(M). (3.22) 
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Lemma 3.4. Consider a k-by-k matrix, M, with a matrix decomposition so that 

M - 



My Mi, 2 

M 2 i M 2 , 2 



(3.23) 



where M^i is l-by-l, Mi >2 is l-by-(k — I), M 2) i is (k — l)-by-l, and M 2 , 2 is (k — l)-by-(k — I). 
Then, for i — 1, . . . , I 

<7i(M) > ^(Mix). 



(3.24) 



M 1>2 MJ 2 
M 2>2 M 22 



Proof: Consider the following positive semi-definite matrix: 

MyM^i + M 1)2 Mf 2 Mi^Mjj - 
M 2i iM^ + M 2>2 M[ 2 MyM^ - 

It follows from Fischer's Minimax Theorem (page 510, A.l.c [21]) that for i — 1, 

Aj(MM T ) > AjfMyMy + Mi, a M^)- 

By means of a result of Loewner (page 510, A.l.b [|2T1l ): 

A i (Mi )1 M^ 1 + M 1)2 Mj 2 ) > Ai(Mi i iM^ 1 ). 



(3.25) 



(3.26) 



(3.27) 



The result then follows by combining these two inequalities. ■ 
We conclude this section by noting that if a matrix representation, F, is finite dimensional, 
then its singular values are well-defined and we use <Tj(F) to represent the i-th largest singular 
value. One can extend this concept to the infinite dimensional case in the following way. For 
any matrix F and corresponding sequence of principal minors (Fi,F 2 ,...) (3.18), it follows 
from the preceding discussion that 



<Xi(Fj) < <7i(F, + i). 



(3.28) 



On the other hand, it is shown in Appendix B that <7j(Fj) < for all i. Hence the limit 
Hindoo (Tj(Fi) exists and is finite. Denote this limit by <7j(F). If F is a strongly regular infinite 
dimensional representation then clearly for all integers i and I, erj(F) = Cj(Fj). 

4. MATRIX REPRESENTATION OF THE OPTIMIZATION PROBLEM 



(4.1) 



The control cost of U and V defined in ( |2.8[ ) can be rewritten in matrix form as 

— trUU T + -trVV T . 

m n 

Our goal of finding the optimal control to realize a given target matrix can now be represented 
as a matrix optimization problem. One is also interested in approximate solutions restricted to 
finite dimensional control subspaces. In particular, for any positive integer /, satisfying I > 
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max(m, n), consider the following optimization problem involving the /-by-/ leading principal 
minor of F, F ; : 

Optimization Problem (H, Fj): Let U and V be m-by-l and n-by-l matrices respectively. The 
optimization problem is defined by: 



subject to the constraint: 



min ( — trUU T + -trVV T ) (4.2) 
u,v \m n 



H = UF Z V T (4.3) 



While it is not clear whether optimal sequences of controls for the above problem always exist, 
the infimum cost, denoted by CV(H), is always well-defined if the target matrix is realizable. 

One can formulate a slightly more general version of this optimization problem by allowing 
the weights in the cost function to be arbitrary positive integers: 

Generalized Optimization Problem (H, F t ;p, q): Let H be an m-by-n target matrix, F z be an 
/-by-/ leading principal minor of F, with / > max(m, n), p and q be arbitrary positive integers. 
The optimization problem is defined by: 



subject to the condition 



min ( -trUU T + -trVV T ) (4.4) 
u,v V p q 



H = UF;V T (4.5) 



There is an important connection between these two classes of problems. Given an m-by-n 
target matrix, H, one can obtain another target matrix by appending rows of zeros and columns 
of zeros to obtain an /-by-/ matrix, with / > n, I > m, so that 



H 



(4.6) 



H Om,i— n 

Oz— m,n Oi— m,l— n 

where Oj j is an i-by-j matrix with all zeros. 

Now consider the Generalized Optimization Problem (H, F z ;m, n). Any optimal solution to 
the Generalized Optimization Problem must satisfy the property: 

m = 0, Vj = (4.7) 

for % > m and j > n. Otherwise, a lower cost solution can be obtained by substituting with 
these zero controls. From this, one can conclude that the solution is also optimal for the lower 
dimensional Optimization Problem, (H, F t ). Conversely, an optimal solution to the latter problem 
can be extended to an optimal solution to the Generalized Optimization Problem (H,F z ;m, n). 
Hence, by using the Generalized Optimization Problem formulation, we can assume without loss 
of generality that the target matrices are square matrices with the same dimensions as F t . 
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5. SINGLE ROUND PROTOCOLS: MINIMUM ENERGY CONTROL 
One of the key results in this paper is summarized by the following theorem. 

Theorem 5.1. Consider a bounded bilinear input-output mapping, F, with a regular matrix 
representation F with rank s. Let H be an m-by-n target matrix, such that s > max(m, n). The 
infimum control cost of any single round protocol that realizes H is given by: 

min(m,n) 

C F {H) = ^= *fc(H)/<7 fc (F). (5.1) 



I ran 



fc=i 



If the matrix representation is strongly regular, there exists a single round protocol that achieves 
this infimum control cost. 

Before proving Theorem |5.1[ we present a corollary that specializes the result to the B- 
H system ( |3.4| ). First note that the diagonal entries of F B in ( |3.12[ ) can be represented as 
(\k/2]n)~ 1 , and this leads to the following result. 



Corollary 5.1. Consider the input-output system (3.4) and an ra-by-n target matrix H. For 
single round protocols that realize H the infimum control cost is given by 

min(m,n) 



27T 

(7 Fr (H) = — r*/2>*(H), 



'ran 



(5.2) 



k=l 



and this is achieved by Alice choosing ra controls and Bob choosing n controls from the space 
spanned by the basis Bo defined in ( 3.20 ). 

To prove the main theorem, we first establish a proposition in which we prove the result for 
a square target matrix H and for controls restricted to finite dimensional subspaces. 

Proposition 5.1. Consider an invertible matrix F; and an l-by-l target matrix H with rank r. 
The minimum control cost for the Generalized Optimization Problem (H,Fj;p, q) is achievable 
and is given by the formula, 

2 



C Fl (H) = — £V fc (H)/<r fc (F,). 



(5.3) 



k=l 



Proof: To prove the proposition, first of all we show that the right-hand- side of (5.3) can be 
achieved. Let II be the orthogonal matrix that puts F;Ff into the following diagonal form: 

"^(FO ... 



n T F ; Ffn 



o 



of (Fi) J 



(5.4) 



Define U = n T Un and V = n T VIl. Then (U, V) is a solution to 



H = UF,V 



T 



(5.5) 
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if and only if (U, V) is a solution to the equation 

n T Hn = un T F,nv T . (5.6) 

Since the cost of (U, V) and (U, V) are identical, we can assume without loss of generality 
that FiFj is in the diagonal form (5.4). Let © be the orthogonal matrix that diagonalizes H T H 
so that 



0H i H9 J 



(5.7) 



and let 



Ra 



1/4 



^(H^F,) 



/a r (H)a r (F ; ) 







(5.8) 



for a small S > 0. In equation (5.7) the lower right-hand- side zero-diagonal block is absent if 
r = I. Similarly for equation (5.8), the lower right-hand- side 5-diagonal block is absent if r = L 
Define 



Us = H©Rr\ vj = f^HsQ 1 



we obtain a solution to ( |5.5[ ). By direct computation, it follows that 

V 



-trUtUT = -trH0R7 2 T H T 



P 
1 

P 



trRV 2 & T U T U& 



1 r 

— ^t7 fc (H)/t7 fc (F,). 



(5.9) 

(5.10) 
(5.11) 

(5.12) 



k=l 



Since entries of Ua are affine functions of 1/5, equation (5.12) implies Ua is independent of 5. 
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We express this by writing U,5 

1 



Q 



trVsVj = -tr0R 5 (F,F 



q 
1 

<i 
l 



U . It is clear that lim^o V,5 exists. Denote it by V . Then, 
I ... ~ ~. (5.13) 

(5.14) 
(5.15) 



-*rR?(F { F? 



1 r 

—Y 



k=l 



(5.16) 



Taking the limit as 5 — > 

1 

P 



trU U^ + UrVoVg 



£)<7 fc (H)/<7 fc (F, 



(5.17) 



fc=i 



proving that the right-hand- side of (5.3) can be achieved. 

To complete the proof, we want to show the right-hand- side of (5.3) is a lower bound for all 
single round protocols realizing H. We show this first for invertible H. In this case, all solutions 
(U, V) to the equation (5.5) consist of invertible matrices and for every invertible V there is a 
unique matrix U that solves (5.5). By applying polar decomposition to F;V T we obtain 



V T = Fr 1 R0 T , 



(5.18) 



for an orthogonal matrix, ©, and a non-singular symmetric matrix, R. These two matrices can 
be regarded as free variables and we can parametrize the solution space to ( |5.3| ) by R and ©. 
Specifically, if we let U = H0R \ then the control cost is 



-trUU T + -trVV T 
P Q 

= -trR~ 2 T H T H0 + -trR^FFj)- 1 . 
P 1 

For any two l-by-l positive semidefinite symmetric matrices, P and Q 

i i 
trPQ = ^A fc (PQ) > ^A fc (P)A ; _ fc+1 (Q). 



(5.19) 



(5.20) 



fc=i 



(See, for instance, p. 249 of [21].) Therefore, 



fc=l 
I 



(5.21) 
(5.22) 



fc=i 
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Similarly, 
Hence, 



trR *& T H T H& > EL=i^(H)/A 2 fe (R). 



(5.23) 



(5.24) 



fc=l J k=l 

One can minimize the right-hand- side of ( 5.24| ) by considering 



mm 



in 



k=l 



1*2 (H) 



U 



subject to the constraint 



h > t 2 ■ ■ ■ > ti > 0. 



Via calculus, the unconstrained optimal solution to (| 5.24| ) is shown to be 

t k = <7 fc (H)«7fc(Fj) 



which also satisfies the condition in (5.26). It follows that 



C Fi (H)>— £V fc (H)/a fc (F,). 

fc=l 



(5.25) 



(5.26) 



(5.27) 



(5.28) 



By (5.17) the right-hand-side of (5.28) can be achieved, hence the inequality is an equality and 
the proposition holds for invertible H, that is r — I. 

We want to show the inequality (5.28) also holds when H is not full rank. Suppose U and 
V give an optimal solution to (H, Ff,p, q). Define 



U(el 



U + el, V(e) = V + el, 



(5.29) 



where I is the /-by-/ identity matrix. Other than at most a finite set of values, all these matrices 
are invertible. Without loss of generality, assume that for some a > and all < e < a, the 
matrices U(e) and V(e) are invertible and restrict e in subsequent discussion to such an interval. 
Let U(e) denote the set of controls corresponding to U(e) and similarly, V(e) denote the sets of 
controls corresponding to V(e). Define 



G(e) = U(e)FV(ef = H + e(UF, + F,V T ) + e 2 F h 



(5.30) 



For < e < a, the matrices G(e) are invertible. Hence, 
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I(U, V) = -trUU T + -trVV T 
V q 

= lim ( -trU(e)U(e) T + -trV(e)V(e) T 



> hmCWGfe)) 



Q 



^-^Z>*( G ( c ))/**< 

e->0 . /no ' 



fe=l 



£V fc (H)/a fc (F,) 



fc=l 



^a fc (H)/a fe (F i ) = C , Fi (H). 



(5.31) 

(5.32) 
(5.33) 

(5.34) 
(5.35) 
(5.36) 



fc=i 



Note that the equality (534b follows from the fact that the proposition holds for invertible target 
matrices, while the equality (5.35 ) follows from the continuity of the singular values as a function 
of the matrix coefficients. This proves the proposition. ■ 
Proof of Theorem 5.1 If F is finite dimensional, the theorem follows from Proposition 5.1 if 
we take F/ to be F s . Hence, assume that F is infinite dimensional and let {ei,e 2 ,...} and 
{fi, h, ■ • •} be the bases corresponding to the representation of F. Let IA = {ui, . . . , ii m } and 
V = {vi, . . . ,v n } be the controls in an optimal solution for the Optimization Problem (H, F), 
with corresponding matrices U and V respectively. Let 



E 



3=1 



Then 



and 



_ m oo . n oo 



i=l j=l 



i=l j=l 



H = UFV T . 



(5.37) 



(5.38) 



(5.39) 



Define approximating control functions 



a 



(0 



E 

i=i 



(I) 



with corresponding m-by-Z and Z-by-n matrices 

an . . . an 



U, 



V 



61,1 



6i, 
/->/ , 



(5.40) 



(5.41) 



February 2, 2012 



DRAFT 



22 



Then, equation ( |5.38[ ) can be rewritten as 

1 



1 



I(U,V) = — trUU T + -trVV r 

m n 

= lim ( —trViVf + -trVjVf 

i->oo \ m n 



Define the m-by-n approximate target matrix by 

H, = (if®) = tJFtVf , 
where Fz is the /-th principal minor of F. Since F is bounded, 



#y = F(v,i, Vj) = lim F(iq\m') = lim if, 



2— >oo 



2— >oo 



r(0 



In matrix form: 



lim UjFjV/ = lim H, = H. 

2— >oo 2— >oo 



(5.42) 
(5.43) 

(5.44) 

(5.45) 
(5.46) 



Since the singular value function is continuous on the space of m-by-n matrices, for any integer 
k, < k < m, we have 

lim<7 fc (H,) = <7 fc (H). (5.47) 

2— >oo 

For any integer / ^ max(m,n), define the /-by-/ augmented matrix Hz by 



Hz m> /_ n 

Oi— m,n Oz— m,2— ri 



(5.48) 



Here, Oy represents an i-by-j zero matrix, (empty when one of the dimensions is zero.) As argued 
before, the minimum control costs for the Optimization Problem (Hz,Fz) and the Generalized 
Optimization Problem (Hj,Fj,ra, n) are identical. By Proposition 



5.1 



m n 



1 



'mn 



J> fe (H0M(F,). 



(5.49) 
(5.50) 



k=l 



Moreover, the last expression in (5.50) can be realized by some control functions. 



The rank of Hz is equal to the rank of Hz, which is at most min(m, n), moreover, the first 
min(m, n) singular values of the two matrices are identical. It follows that for k > min(m, n) 



o"fc(Hz 



0. 



and the last expression in (5.50) is equal to 



min(m,n) 



cr fc (Hz)/cr fc (Fz) 



(5.51) 



(5.52) 



k=\ 
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and can be realized. The first part of the theorem then follows from: 

11 o min ( m,n ) 

-trUU T + -trVV T > lim -= V <r fc (H,)M(F { ) 

v k=l 



min(m,ra) 

^ Cfe(H)/(jjt(F). (5.53) 



>mn 

k=l 



Since F is diagonal, a k (Fi) = cr k (F) for all integers k and I, the solution to the Optimization 



Problem (H;,Fz) achieves the control cost given by the lower bound in (5.53). This completes 
the proof of Theorem 5.1. ■ 
We conclude the discussion in this section by pointing out that there is an apparent non- 



symmetry in the solution constructed in the proof of Proposition 5.1 Even if H and F are 
symmetric, the optimal solution may not be symmetric in the sense that U and V may not be 
identical. This would appear less surprising if one considers the example where H is 



1 

1 



(5.54) 



and F is the 2-by-2 identity matrix. Having a symmetric solution to the optimization problem 
would imply H is non-negative semidefinite which is obviously not the case. 

6. COST OF DISTRIBUTED ACTION 

Results in the preceding section provide an explicit formula for computing the control cost in 
the absence of any communication between the agents. However, if information can be shared 
between the agents, controls with lower average cost of distributed action can be designed. For 
the B-H system, if both agents have information on the other agent's choice, they can use the 
following control functions to realize the target, Hif 

u ij{t) = sgn(Hij)^y27THij cos(2-Kt), (6.1) 
Vij(t) = v /27rif ij sin(27rt). (6.2) 



The control energy of such a protocol is 2ir\Hij\, Using the isoperimetric inequality [25], one 
can show as in [ 30] that this is the minimum control cost to realize the single target, H^. 

Hence, if both agents have complete information of the other agent's choice, the averaged 
control cost over all possible choices (m for Alice and n for Bob) is 



„ m n 
Z7T x — ■> x — ■> 

mn 



J(H) = — 2^2JiU (6.3) 

mil t—^ t—^ 

=1 3=1 

for the B-H system. One can compare this control cost with the control cost for a single round 



protocol defined in (2.9) and note that while the number of control input pairs available to Alice 



and Bob is the same in both cases, the number of distinct controls used by each is typically not. 



February 2, 2012 



DRAFT 



24 



In order to enable a concrete comparison between single round protocols and protocols based 
on completely shared information we will use the B-H system as an example. First of all, we 
consider the case that H is a Hadamard matrix of order n. Since the entries of such matrices 
are either 1 or -1, the averaged control cost with perfect information J(H) is simply 2n. On the 
other hand, the singular values of H are all equal to y/n. Thus, by Corollary 5.1 the optimal 
single round protocol cost Cp B (H) is 

^^(n + 2) for even n, 

+ 2 + for odd n. 

Thus, the ratio of two control costs is asymptotically (n + 2)y/n/A. 

We can perform a similar comparison for the case of orthogonal matrices. To do so, we need 
to quote the following bounds on the sum of the absolute value of the entries in an orthogonal 
matrix. These elementary results on matrices are provided here for the sake of completeness. 

Proposition 6.1. Consider B-H system. For an orthogonal H, 

<-rJ2 Hij < -=. (6.5) 



n n 2 ^— ' ' " ' \/n 



For any square matrix H, 



n 2 ' — ' ' n 

i,j=l k=l 



Proof: To prove the lower bound when H is an orthogonal matrix, use the inequality 

n n 

J2 Aj >J2 lI i- n - (6 - 7) 

i,j=l i,j=l 

To prove the upper bound, consider the optimization problem, 

rt 

S = max Xij (6.8) 

subject to the constraint that for all j, J^^itfj = 1- Since the constraint is weaker than the 
requirement that the Xj/s form an orthogonal matrix, it follows that 



\Hij\ < S = n max ^^ z i = n\/n. (6.9) 



z I + -- + z n , 

i,3=l i=l 



For the case where H is a general square matrix, decompose it by SVD so that 

(6.10) 



H = $A0, A 



oi(H) •■■ 
tr 2 (H) •■■ 



••• <r n (H) 
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where <3? and © are orthogonal. For k, 1 < k < n, define an n-by-n matrix, H(A;), by 

HO) = $A fe e, A k = dia#[0 i _^, ( 7 fe (H),0 i _^, (6.11) 

k—l n—k 

Then, 

E^=il^(%l = ^(H) Ey=il**e w | 
< ^(H) EILi \®ik\ Eti < ^(H). 

From this, it follows that 

ij=l fc=l i,j=l k=l 



(6.12) 



This result together with Corollary 5.1 imply the following 



Theorem 6.1. For the B-H system and a general n-by-n target matrix H, 

9 n 

C Pb (H) - J(H) > — y)(rV2l - l)^fc(H) > 0. (6.14) 

fc=i 

r/ze difference in the control energy is strictly positive if the rank of H is larger than 2. 
If H is an orthogonal matrix, then the optimal single round cost CV(H) is 

(6.15) 



|(n + 2) for even n, 



2 



(n + 2 + -) foroddn. 



By comparison, 

— < J(H) < (6.16) 



For example, consider the case where n = 2. The identity matrix, I 2 , incurs an averaged cost 
of 7r under information sharing: there are two entries whose computation requires control energy 
of 27r; for the other two entries, H 12 and H 2 i, zero control can be used. However, for the case 
where the information on the choice is not shared, it is not possible to save control cost by 
setting any of the controls to zero, resulting in a cost increase by a factor of 2. In general, the 
control cost ratio grows super-linearly as a function of the dimension of target matrix. 



7. MULTI-ROUND PROTOCOLS 

Analysis from the previous section indicates that control cost can be substantially reduced by 
using protocols that allow communication between the agents. For the extreme scenario where 
the agents have complete prior information of each others inputs, optimization of the control 
cost can be reduced to solving a family of single output target optimization problems. For any 
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m-by-n target matrix H and a general bilinear input-output mapping, F, the averaged control 
cost for solving a family of single output target optimization problems is given by: 

„ m n 

J(H) = T^vWl^l- (7.1) 



This is a generalization of equation (6.3). 



If the agents do not have prior information on the inputs to be selected, they can communicate 
their input choice to each other in a multi-round protocol. Detailed analysis of general multi- 
round protocols lies beyond the scope of this paper. We shall briefly consider two-phase protocols, 
however, in which one phase allows partial information to be shared at negligible cost. The main 
result here is that if the cost of signaling certain partial information is negligible, the control 
cost can be made arbitrarily close to the lower bound, J(H) that was determined in the previous 
section. It is important to note that the protocols approaching J(H) do not necessarily require the 
agents to communicate their choices completely to each other. It will be shown that the number 
of bits communicated is related to the classical communication complexity of computing H. 
This result provides insight into the value of information in terms of the control energy savings 
that can be achieved through communication between the agents as they evaluate H. 

Various concepts of communication at negligible cost can be considered. One possible ap- 
proach to partial information exchange in a two-phase protocol is to use a side channel in 
an initiation phase in which it is assumed that there is no cost of transmitting some partial 
information between the agents. It is in the second phase that this information is used by the 
agents to select controls that effect the computation specified in the single-round protocols of 
Section 5. Another approach to information sharing at negligible communication cost is to assume 
that under certain circumstances, very low cost control signals can be used. Formally, for the 



model (2.3) described in section 2, we introduce the following concept of e-signaling capability. 



Definition 7.1. Alice possesses t-signaling capability around the initial state x if for any e 

there exist times, < t\ < t<i and controls u\ and U2 for Alice, V\ for Bob, so that 

\ (u\ + v\)dt < e, / (uj + v\)dt < e. (7.2) 
Jo Jo 

Moreover, when u is set to U\ or u 2 and v is set to V\, the following conditions hold: 

1) x Ul ^(ti) ^ (h), 2) (t 2 ) = x . 

One can define similar capability for Bob. For example, for the B-H system both agents possess 
e-signaling capability as the loop controls can enclose arbitrarily small areas. For systems in 
which both agents have e-signaling capability one can design multi-round protocols for realizing 
H with control cost arbitrarily close to .7(H). These protocols consist of two phases. In the first 
phase, the agents communicate their choices of inputs to each other. Based on the information 
received, the original target matrix is decomposed into a finite number of sub-matrices and 
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controls can then be applied to realize the sub-matrix that corresponds to the choices of the 
agents. We call such a protocol a two-phase protocol. To describe the detail, we recall the 
definition of a monochromatic matrix, (see for example Ifl6l0 . 

Definition 7.2. A sub-matrix is said to be monochromatic if all its entries have the same value. 

Given an m-by-n target matrix, H, we can define a set of sub-matrices, {Hi, . . . ,Hjf}, so 
that Hfc is an m k -by-n k sub-matrix with its entry defined by 



H k (i,j) = ///,,,,,.. 



(7.3) 



where t kji lies in an index set M. k c {1, . . . , m} and s k j lies in an index set M k C {1, . . . , n}. 
Note that by definition, \M k \ = rn k , \Af k \ = n k . Define l k = min(m k ,n k ). 
The set of sub-matrices forms a matrix partition for H if the following holds: 

1) For any i and j with 1 ^ i ^ m and 1 ^ j ^ n, there exist k, a, and (3 such that 

tk,a = h S ky p = j. 

2) If t k ,a = tk>,a>, s k) p = s k >^>, then k = k', a = a', and (5 = f3' . 

For example, the following figure shows a sub-matrix partition involving five sub-matrices: 



#11 


#12 


H21 


#22 


#31 


#31 


#41 


#41 



#13 


#14 


#23 


#24 


#33 


#34 


#43 


#44 



(7.4) 



It follows from direct verification that for all A; 



(7.5) 
(7.6) 



£ oj»(H fc ) = trH.Hl = £ ]T # fc 2 (U) 
;=i i=i j=i 

EY^ H 2 

Thus, for a sub-matrix partition into K sub-matrices, 

K Zfe m n 

k=l 1=1 i=l i=l 

A sub-matrix partition can be regarded as a decomposition of a complex distributed control 
problem into simpler sub-problems. We can estimate the effectiveness of a decomposition by 
calculating the control cost averaged over the decomposed sub-problems, namely, 

1 K 

A = —y2m k n k C F (n k ). (7.8) 



(7.7) 



k=l 
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The following result provides a lower bound for this averaged control cost. 

Theorem 7.1. Consider a bounded, bilinear input-output mapping, F, with a regular matrix 
representation F. The average control cost, A, for applying single round protocols to the sub- 
matrices in a sub-matrix partition {H 1; . . . , ofH satisfies the lower bound: 

^ k 2 m n 

A = Vm fc n fc C' F (H fe ) > — V V |# fi |. (7.9) 

mn ^ mno i(F) ^ ^ 

k=i iV ' i=l j=l 

If all the sub-matrices are monochromatic, this lower bound is the infimum value of A. 
Proof: 

1 K 

A = m k n k C F (Hk) 



k=l 

K l k 



2 K / h 



mno 'i(F , | | 



(7.10) 



> VVw^y>*(H fe )M(F) (7.11) 

k=l 1=1 

2 K lk 

> — ^m k n k o-j(Hfc) (7.12) 

mno 1 (F) ^ 



1/2 

^ "^ E l m ^E^ H *) ) ( 7 - 13 ) 



mno 1 (F) k=1 v 

2 K / m fc n fe \ V2 

= — ^vE ^EE^ 2 ^') < 7 - 14 ) 

K m k n k 

^EEEwa cms) 

iV ' fc=i i=i j=i 

~ m n 

EE < 7 - 16 ) 



(7.15) follows from the well-known inequality that for any real numbers, (xi, . . . ,x pJ 

2 



pJ2 x ^ E^) < 7 - 17) 

i=l \i=l / 

with equality holding if and only if all the x/s equal to each other. 

According to Theorem 5.1 the infimum value of A is given by the right-hand- side of ( |7.1 1[ ). If 
all the sub-matrices are monochromatic, then oi(H k ) if I > 1. Hence inequalities (7.12), (7.13), 
and (7.15) all become equality, and the last expression in (7.16) is the infimum value. ■ 

In the first phase of a two-phase protocol, the agents communicate with each other via 
the dynamic system by means of e signals. The bit sequence defined by the communication 



February 2, 2012 



DRAFT 



29 



complexity protocol can be regarded as an algorithm to identify the chosen sub-matrix in a 
given partition. We can visualize the algorithm by moving down a binary tree, so that depending 
on the value of the bit sent by either one of the agents in the communication protocol, we 
descend from a given node to its left or right child. For additional details about communication 
protocol,we refer to ifToll or ll29l . To make explicit contact with [16J, suppose that each sub- 
matrix in the given matrix partition is monochromatic. The number of leaves in the binary 
tree that defines the protocol is equal to the number of sub-matrices defining the partition, and 
each of these sub-matrices is mapped to one of the leaves of the binary tree. The maximum 
number of bits communicated in the protocol is equal to two times the depth of the tree. (Since 
the communication has to pass through the dynamic system, if Alice wants to send one bit of 
information to Bob, the bit has to pass from Alice to the dynamic system and from the dynamic 
system to Bob, leading to two communication bits being exchanged.) The protocol complexity 
is thus defined as two times the height of the binary tree and can provide an upper bound for 
control communication complexity. 



For illustration, consider a target function, 

H = 



1111 
115 5 
2 3 5 5 
2 3 5 5 



(7.18) 



The sub-matrix partition shown in (7.4) represents a decomposition of the target matrix into 
the minimum number of monochromatic blocks. (Such minimal decomposition is not unique.) 
One can define a communication protocol to identify the components of the matrix partition 
(monochromatic blocks in this case) as follows. 

Protocol to communicate the structure of a matrix partition: Assume that Alice controls the 
choice of columns and Bob controls the choice of rows. 

1) Alice sends a bit to Bob with value if she chooses the first 2 columns, otherwise she 
sends 1. 

2) Upon receiving a bit of value 1 from Alice, Bob sends a bit to Alice with value if he 
chooses the first row, otherwise, he sends a bit of value 1. No further bit needs to be sent. 

3) On the other hand if a bit of value is received from Alice, Bob sends a bit to Alice with 
value if he chooses the first two rows, otherwise he sends a bit 1. Only in the latter 
case, Alice sends one more bit, with value if she chooses the first column and 1 if she 
chooses the second column. 

The communication protocol can be represented by the binary tree shown in Figure [T] A 
maximum of six bits (counting bits sent by the dynamic system) are needed in this protocol in 
order to guarantee all sub-matrices can be identified. 

In our two-phase protocol, once the communication phase is completed — which is to say 
that the phase-one protocol has run to completion and a leaf node identifying a sub-matrix has 
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been reached, the second phase of the protocol starts. The target matrix that is collaboratively 
evaluated by the agents in phase two is the sub-matrix that had been selected in phase one. It is 
assumed that in phase one, communication of negligible cost (e.g. e-signaling) occurs, but that 
in phase two, open-loop controls of the form described in Section 5 realize the output specified 
in the chosen sub-matrix. It is clear that one can construct two-phase protocols with total control 
cost arbitrarily close to ( |7.1| ), the lower bound for all protocols realizing H. We summarize the 
results in this section in the following theorem. 

Theorem 7.2. Consider a bounded, bilinear input-output mapping, F, with a regular matrix 
representation F. Suppose both agents have e-signaling capability around the initial state. Let 
H be an m-by-n target matrix. The infimum control cost of any multi-round protocol that realizes 
H is given by: 

„ m n 

J(H) = t^tW I #»|- (V.19) 

v ; ^W F ) ~™ 



By comparing the control cost of an optimal single round protocol as given by (5.1) with 



that of the multi-round protocol (7.19), one can estimate the value of communicated bits in 



reducing the control energy cost. Using the target matrix H of (7.18) as an example, for the 



B-H system the minimal control cost without using any communication is defined in equation 
\5.2\ and has a value of 7.687T. If communication cost is negligible then the minimal control cost 



is approximated by the right-hand- side of (7.9) and has a value of 2.887T. This can be achieved 



by the information sharing protocol shown in Figure [TJ which has a protocol complexity of 6 
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(= two times the length of the binary tree). The protocol complexity is a "worst case" metric, 
reflecting the maximum number of bits that might need to be communicated. Thus, the value of 
a single communication bit in reducing the control energy cost for this problem is at least 0.87T 



with the units being control energy (as defined by (2.8)) per bit. 



8. CONCLUSION 

This paper has continued our study of problems in control communication complexity, which 
may be viewed as an extension of classical communication complexity with the additional focus 
on control cost. There are several important application contexts in which the optimization 
problems of the type we have considered seem to arise naturally. The single round protocols 
for steering the B-H system realizes the solution to a problem in distributed computing where 
independent agents act to evaluate a function without foreknowledge of each other's choices. As 
was noted in Q, the problems also arise naturally in what we have called the standard parts 
optimal control problem in which it is desired to find a specific number, m, of control inputs 
to a given input-output system that can be used in different combinations to attain a certain 
number, n, of output objectives so as to minimize the cost of control averaged over the different 
objectives. The connection with quantum control, and in particular, the control of quantum spin 
systems (see for example the references to quantum control and computation in 113010 is another 
interesting line of investigation that is under way and will be treated elsewhere. 
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Appendix A 

To prove the rank of a matrix representation of a bilinear input-output mapping is well-defined, 
let F be a representation corresponding to the bases {e 1; e 2 , . . .} and {fx, f 2 , . . .}. Let F' be a 
representation corresponding to the bases {ei, ■ ■ ■} and {/{, f 2 , . . .}. Assume first that F has 
finite rank equal to r. By relabeling the indices, we can assume without loss of generality that the 
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first r columns of F are independent. Hence, there exists a matrix C with r rows and infinitely 
many columns such that 



(8.1) 





F\2 


F13 




F21 


F22 


F23 




F31 


F32 


F33 




' F u 


F\2 




F\r 


F21 


F22 




F2r 


F31 


F32 




F% r 



C = FC. 



(8.2) 



By relabeling the indices of the basis {e 1: e 2 , ■ ■ ■} if necessary, we can assume that the first r 
rows F are independent. Label the sub-matrix of F formed by the first r rows by D. Note that 
D is an invertible r-by-r matrix and 



C = D 1 



Fu F\2 F13 
F21 F22 F23 



F r i F r 2 F, 



r3 



(8.3) 



That is, for 1 < k < r and 1 < I 



Cm - y~]DtoFu 



(8.4) 



Let Co be the upper bound of the entries in D 1 , so that for all 1 < i, j < r, D^ 1 < C < 00. 
For any positive integer j, the base vector, f'-, has a representation of the form 

00 

/; »,//,. (8.5) 

1=1 

The coefficients are square summable so that Yliti a % < 00 • F° r an y 1 ^ * ^ r ' define 

00 

9ij = ^2ajisgn(a j iF il )f l . (8.6) 
i=i 

Since the coefficients for are square summable, this is a well-defined element in C. Moreover, 

00 

Fie^gij) = ^2 \ a ji F u\ < 00, (8.7) 
;=i 

since F is bounded. On the other hand, 

00 00 r 

F[ 3 = F(e h /;■) = OLjiFu = Uji E F * C "- ^ 



1=1 



1=1 k=l 
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Formally, by exchange the summation order, the right-hand- side of equation ( |8.8[ ) is equal to 

r oo r oo 

F ' ik a i l ^ kl = FikEjk, Ej k = QjiCki- 



(8.9) 



k=l 1=1 



k=l 



1=1 



The equality holds if the expression in (8.9) is convergent. By means of equation (8.4), 

oo r r oo 

\E jk \ <j2J2\ a n D £ F a\ < c j2J2\ a * F »\- ( 8 - 10 ) 



1=1 i=l 



8=1 1=1 



It follows that the expression in (8.9) is convergent and 



F'ij — Oiji FikCki — Fik oijiCki- 



(8.11) 



fc=i 



fc=i 



Then equation (8.11) can be rewritten as F' = FE, E = [Ey] . Therefore, the rank of F' is 
equal to or less than r. By reversing the roles of F and F', the ranks of the two matrices must 
be identical if one of them has finite rank. Thus, if the basis {ei,e 2 , . . .} is fixed the rank is 
independent of the other basis. Similarly, one can argue that if {/i, / 2 , . . .} is fixed, the rank is 
independent of the other basis. 

In the general case, consider matrix representation, F, with respect to bases {ei, e 2 , . . .} and 
/2, • • and matrix representation, F", with respect to bases {e' 1; e' 2 , . . .} and {/{, f 2 , . . .}. 
If the rank of F is finite, it is equal to the rank of the matrix representation with respect to the 
bases {ei, e 2 , . . .} and {/{, fy, ■ ■ .}, which in turn is equal to the rank F". It is easy to see that 
it is not possible for one representation to have finite rank while the other has an infinite rank. 
Thus the rank is independent of the choice of the bases. 

Appendix B 

Let {ei, e 2 , . . .} and {/i, f 2 , . . .} be orthonormal bases corresponding to the representation F. 
Let v = Y^\=i v jfj b e an element of C and v T = [vi, . . . , v{\ be the corresponding /-dimensional 
vector. Let u T = v T Ff and denote the corresponding element ^ i=1 UiCi in £ by u. Then, 



F(u,v) = u T F^v = v T FfF ; v. 



Using the assumption that F is bounded and the inequality (2.6), one can show that 

||Fiv|| 2 = v T FfF ; v = F(u,v) < ||F||||v||||F/v|| 

< ii^iriivii 2 . 

Let Wj be the eigenvector of F^F ; with unit norm that corresponds to 0i(Fj). Then, 



tr?(F,) = wfFfF^w, = mwAl 2 < \\F\\ 2 . 



(9.1) 

(9.2) 
(9.3) 

(9.4) 
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